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EXECUTIVE  SUMMARY 


The  Environmental  Medicine  Genome  Bank  (EMGB)  project  is  an  ongoing  effort 
to  identify  and  characterize  genes  relevant  to  environmental  illnesses  and  to  human 
physical  performance.  To  accomplish  this,  the  EMGB  banks  DNA  samples  from  human 
volunteers  who  have  participated  in  environmental  and  human  performance  studies  or 
material  obtained  under  approved  Brigham  and  Women’s  Hospital  protocols  that  would 
otherwise  have  been  discarded.  The  EMGB  maintains  a  registry  of  this  phenotypic 
information.  The  EMGB  can  be  used  to  identify  polymorphisms  in  genes  that  are 
potentially  of  interest  to  environmental  medicine  and  to  obtain  an  estimate  of  the 
frequency  of  these  polymorphisms  in  young,  healthy  U.S.  adults  because  of  the 
ethnically  diverse  and  geographically  dispersed  backgrounds  of  the  donors. 

Additionally,  this  resource  also  serves  as  a  valuable  source  of  control  material  for 
genetic  studies  of  human  diseases,  such  as  asthma.  The  project  is  performed  as  part 
of  a  cooperative  research  and  development  agreement  (CRDA)  with  the  Division  of 
Pulmonary  and  Critical  Care  Medicine  at  Brigham  and  Women’s  Hospital. 


B-lymphocytes  immortalized  using  the  Epstein  Barr  Virus  (EBV)  are  incorporated 
into  the  EMGB  in  attempts  to  maintain  stocks  of  genetic  material  that  are  characterized 
by  phenotype.  These  samples  include  cells  and  DNA  from  asthmatics,  as  well  as  from 
characterized  non-asthmatics. 


This  report  provides  updated  information  about  the  samples  currently  stored  in 
the  EMGB.  It  is  intended  as  a  reference  document  for  researchers  who  wish  to  make 
use  of  this  resource,  and  fulfills  the  annual  reporting  requirement  of  CRDA  number 
DAMD  17-00-0017. 


INTRODUCTION 


Based  on  previous  reports,  it  seems  likely  that  there  is  a  significant  genetic 
contribution  to  some  aspects  of  human  physical  performance  (1-5;7;9;13)  and  to  the 
susceptibility  to  environmental  illness  and  injury.  However,  very  few  candidate  g^nes  ^ 
have  been  identified,  in  part  because  few  laboratories  have  access  to  large  populations 
of  well-characterized  subjects  drawn  from  a  wide  variety  of  genetic  backgrounds.  The 
U.S.  Army  Research  Institute  of  Environmental  Medicine  (USARIEM)  is  uniquely 
qualified  to  undertake  a  search  for  these  genes,  by  virtue  of  its  access  to  Army 
personnel  and  its  ability  to  define  precisely  those  phenotypes  relevant  to  environmental 
illnesses  and  human  performance. 


Large  numbers  of  samples  are  typically  needed  to  identify  genes  that  contribute 
to  complex  traits  (2).  Accordingly,  the  Environmental  Medicine  Genome  Bank  (EMGB) 
banks  DNA  samples  obtained  from  donor  white  blood  cells  and  catalogues  phenotypic 
information  obtained  over  the  course  of  multiple  approved  studies.  By  pooling  samples 
and  data  from  several  studies,  it  becomes  possible  to  undertake  genetic  analyses  that 
would  otherwise  not  be  feasible. 


The  EMGB  serves  as  an  Institute  resource,  and  anonymous  aliquots  from  the 
bank  are  available  to  individual  investigators  upon  request.  This  document  summarizes 
the  current  contents  of  the  bank. 

DNA  samples  of  the  EMGB  entered  previously  were  purified  from  terminally 
differentiated  cell  lines  and  were  non-renewable  sources  of  genetic  material. 
Immortalized  lymphocytes  have  been  added  to  the  EMGB  so  the  cells  can  be  grown  at 
any  time  to  extract  the  DNA.  These  cell  lines  represent  a  renewable  source  of  genetic 
material.  The  samples  of  the  EMGB  that  have  not  been  immortalized  will  eventually  be 
irreplaceably  consumed. 
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MATERIALS  AND  METHODS 


VOLUNTEERS 


With  the  exception  of  study  #1  (normal  controls)  and  study  #7,  subjects  wgre 
participants  in  other  USARIEM  studies  of  environmental  medicine  and  physical 
performance  (Table  1).  All  subjects  gave  consent  in  accordance  with  45  CFR  46.  In 
study  #1 ,  subjects  were  recruited  directly  for  the  purpose  of  creating  a  core  cohort  of 
DNA  samples  from  anonymous,  young,  otherwise  healthy  volunteers.  In  study  #7,  the 
56  immortalized  samples  were  obtained  through  approved  Brigham  and  Women’s 
Hospital  studies  using  material  that  otherwise  would  have  been  discarded.  These 
samples  are  classified  by  asthmatic  phenotypes  such  as  FEV1,  %FEV  and  skin  testing 
for  atopy. 


From  each  volunteer  20  ml  of  blood  were  drawn  into  10  ml,  heparin-containing 
tubes.  Samples  drawn  from  locations  remote  from  the  analytical  laboratory  were 
shipped  overnight  (at  room  temperature)  to  the  laboratory  for  processing. 


DNA  ISOLATION  AND  STORAGE 


DNA  is  obtained  from  leukocyte  nuclei  after  erythrocyte  lysis,  using  the  QIAamp 
Maxi  Kit  (Qiagen,  Inc.,  Santa  Clara,  CA).  The  isolated  DNA  is  stored  in  aqueous 
solution  (in  water),  at  a  concentration  of  35-150  ng  /  pi  as  determined  by  UV  absorption 
at  260  nm. 


The  EMGB  includes  both  samples  obtained  from  white  blood  cells  (which  are  not 
renewable)  and  immortalized  B  lymphocytes.  The  DNA  from  immortalized  B 
lymphocytes  were  purified  without  nuclei  isolation  and  therefore  may  include 
mitochondrial  DNA.  The  DNA  from  these  samples  have  been  standardized  to  50  ng  /  pi 
dilutions  and  tested  using  CLONTECH’s'p'^  Actin  PCR  primers  (Cat#  5402),  as  outlined 
in  the  rn^nufacture’s  instructions.  This  is  performed  to  ensure  constant  quality  of  the 
template  DNA  during  genetic  studies.  All  future  samples  will  meet  this  standard  before 
aliquots  are  produced  or  dispensed. 


ALIQUOTS 


The  EMGB  was  replenished  by  diluting  the  master  DNA  samples  to  50  ng  /  uL, 
unless  otherwise  labeled,  and  dispensed  in  25  uL  aliquots.  To  minimize  damage  from 
repeated  freeze-thaw  cycles,  each  sample  is  divided  into  a  master  sample  and  several 


aliquots  at  the  time  of  original  isolation.  At  present,  all  samples  are  maintained  at  -80°C. 
Aliquots  are  used  until  exhausted.  The  master  samples  are  thawed  only  when 
aliquots  are  needed. 


LYMPHOCYTE  IMMORTALIZATION  PROTOCOL  ^ 

Immortalization  Solution 


The  Epstein  Barr  Virus  (EBV)  Infection  Solution  is  made  from  EBV-infected 
marmoset  leukocyte  cells.  Marmoset  cells  produce  viable  virus,  which  is  shed  into  the 
cell  supernatant  of  the  growth  media  (8).  The  supernatant  is  considered  to  be  potent 
after  the  cells  have  been  starved  for  approximately  1  week  and  when  the  phenol  red  pH 
indicator  in  the  media  has  turned  yellow.  The  cell  suspension  is  collected  in  a  50  rnl 
centrifuge  tube  and  centrifuged  at  400  x  g  for  1 0  min.  The  supernatant  is  filtered  with  a 
0  2  pm  filter  to  ensure  the  removal  of  marmoset  cells.  This  virus  containing  filtrate 
solution  is  the  infection  solution  that  can  be  stored  at  4°C  for  up  to  3  months,  or  can  be 
frozen  at  -80°C  for  long-term  storage. 


PBMC  Purification 


Subject  peripheral  blood  monocytes  (PBMC)  are  purified  for  immortalization.  20 
ml  of  room  temperature,  heparin  treated  blood  is  added  to  20  ml  of  Hank’s  balanced  salt 
solution  (HBSS)  in  a  50  ml  conical  centrifuge  tube.  Contents  are  gently  mixed  to  make 
a  homogenous  suspension.  Fifteen  (15)  mL  of  Histopaque  - 1077  (Sigma  cat#  10771) 
is  added  to  two  separate  50  mL  Falcon  centrifuge  tubes.  Histopaque  is  a  density 
gradient  solution  that  separates  blood  into  two  cell  types:  red  blood  cells  (RBC)  and  _ 
PBMC.  The  blood-HBSS  solution  is  slowly  layered  on  top  of  each  Histopaque  solution. 
A  sharp  interface  should  be  seen  between  the  two  solutions.  The  tubes  are  centrifuged 
at  400  X  g  for  30  min  with  the  brake  left  off.  Centrifugation  results  in  four  distinct  density 
layers:  plasma,  PBMC,  Histopaque  1077,  and  RBC.  The  second  layer,  the  PBMC,  is 
the  buffy  coat  layer  that  contains  the  cejl^required  to  produce  the  immortalized  cell 
lines.  This  layer  is  extracted  and  pipetted  into  a  separate  50  mL  centrifuge  tube  and 
washed  with  HBSS.  Cells  are  then  centrifuged  at  400  x  g  for  10  min,  the  supernatant  ^ 
decanted,  and  the  washing  step  repeated.  Expected  recovery  is  approximately  1x10 
cells  per  mL  of  blood  drawn. 


Immortilization  of  PBMCs 


The  PBMC  are  suspended  in  2  ml  of  EBV  Infection  Solution  and  the  vented  tube 
is  placed  in  the  incubator  at  37®C  for  1-2  hours.  2  ml  of  media  (RPMI  w/  L-Glutamine, 
Penn-Strep,  and  20%  heat-inactivated  FBS)  is  added  to  the  cell  suspension.  1  ml 


aliquots  are  added  into  four  wells  of  a  treated  24-well  culture  plate.  Cell  are  placed  at 
37°C  in  5%  CO2  for  4  days  and  transferred  to  T-25  or  T-75  culture  flasks  at  a 
concentration  of  0.5x10®  cells  per  milliliter. 


The  immortalized  cell  lines  are  stored  in  liquid  nitrogen  in  a  step-wise  fashion  at 
concentrations  of  5  x  10®  cells  per  milliliter  in  DMSO  Cell  Freezing  Media  by  Sigma  _ 
(Catalog  #  C-6164).  Cells  are  first  stored  at  -20°C  for  2  hours,  then  -80°C  for  12  hours 
before  finally  transferring  the  cells  to  liquid  nitrogen. 


PROCEDURES  FOR  INSURING  CONFIDENTIALITY  AND  ANONYMITY  OF  DATA 

There  are  several  layers  of  protection  of  confidentiality:  First,  the  only  temporary 
link  between  an  individual  and  his  EMGB  number  is  retained  in  hard  copy  format  in  a 
locked  cabinet.  No  electronic  links  between  individual  identifiers  and  EMGB  numbers 
are  retained.  Second,  the  hard-copy  data  are  ordered  by  study  and  EMGB  number,  not 
by  last  name.  Third,  the  EMGB  database  is  on  a  freestanding  (non-networked), 
password-protected  computer.  Fourth,  many  of  the  genotypes  in  the  EMGB  are 
themselves  in  shorthand  notation,  with  the  actual  sequences  they  represent  kept  in 
laboratory  notebooks  or  other  files.  Fifth,  once  no  longer  needed  for  purposes  of  data 
validation,  the  individual  identifiers  on  the  data  sheets  are  defaced  so  as  to  permanently 
destroy  the  ‘hard  copy’  link  between  individuals  and  their  information. 

The  specific  procedures  for  insuring  confidentiality  and  anonymity  are  as  follows: 

a.  Samples  are  collected  at  the  study  site  and  labeled  according  to  the 
individual  study  guidelines.  At  the  same  time,  the  volunteer  and  PI  fill  out 
an  EMGB  data  collection  form  (attached). 

b.  The  samples  and  collectipn  forms  are  shipped  to  the  EMGB  laboratory 
site.  Upon  arrival,  each  sample  and  its  accompanying  data  form  are 
assigned  a  unique  EMGB  number,  which  is  written  on  the  sample  and  on 
the  form. 

c.  The  forms  are  secured  in  a  locked  cabinet  and  constitute  the  only  possible 
link  between  a  subject  and  his  or  her  genotypic  information.  The  forms 
are  retained  to  make  sure  original  data  are  available  to  reconstruct  or 

.  repair  the  electronic  databases  in  the  event  of  data  corruption,  loss,  or 

mis-entry. 

d.  Data  from  the  forms  are  stored  electronically  as  follows: 

i.  All  phenotypic  data,  except  for  names  and  social  security  numbers, 
are  stored  in  an  electronic  database.  These  data  are  linkable  to 
genotype  data  by  means  of  the  EMGB  number. 

ii.  To  prevent  individuals  from  donating  more  than  one  sample  to  the 
EMGB,  a  separate  electronic  file  containing  nothing  but  names  and 
social  security  numbers  is  maintained.  This  file  is  checked  before 
any  new  samples  are  entered  into  the  EMGB  system.  This  file  is 
NOT  linked  to  EMGB  numbers. 


Genotype  data  is  stored  electronically  and  linked  to  phenotypic  data 
through  the  EMGB  numbers. 

All  electronic  data  is  stored  on  a  password-protected,  non-networked 
computer.  Backup  copies  of  the  data  are  stored  in  a  locked  cabinet. 

Once  no  longer  needed  for  data  validation,  the  individual  identifiers  on  the 
data  sheets  are  defaced  (blacked  out  with  a  magic  marker)  so  as  to 
destroy  any  potential  ‘hard  copy’  link  between  individuals  and  their  EMGB 
information. 


RESULTS 


CONTRIBUTING  STUDIES  AND  SAMPLE  USE 


Studies  that  have  contributed  samples  to  the  EMGB  and  the  current  inventoiy  of 
samples  are  listed  in  Table  1 .  This  table  also  lists  some  of  the  phenotypic  information 
available  and  summarizes  some  of  the  genotypic  information  that  has  been  obtained  on^t- 
the  samples  to  date.  So  far,  7  studies  have  contributed  samples  to  the  EMGB.  DNA 
was  obtained  from  most  (but  not  all)  donated  samples,  and  some  samples  (especially 
those  with  low  DNA  yields)  have  been  used  in  their  entirety.  At  present,  the  bank 
contains  DNA  samples  from  343  different  donors. 


Through  the  use  of  the  EMGB  during  the  past  three  years,  the  total  DNA  within 
the  bank  has  diminished.  Eighteen  master  aliquots  from  the  original  283  terminal 
samples  have  been  consumed.  The  changes  in  the  total  number  of  available  samples 
during  the  past  years  are  presented  in  Table  2.  Sixteen  samples  have  been  consumed 
and  are  no  longer  available  for  study.  Twenty  samples  have  5  or  less  aliquot  available 
for  study  Sixty  samples  have  been  added  to  the  EMGB  in  2001  that  have  their  B- 
lymphocytes  immortalized.  These  represent  a  renewable  supply  of  DNA  that  will  not  be 
completely  consumed  as  long  as  the  cell  lines  are  maintained. 


O  , 
0)  O 
O  TJ 
Q.® 

E  8 

-J  (0 

•o  E 

0)  SI 

M  w 
ro  < 

I « 

If 

aI 


ca 

Q. 

o 

O  (0 

‘•S 

(u  ^ 


Table  1  (Continued) 

Key  to  Available  Phenotypic  Information 

1 .  Age,  race  and  gender 

2.  Smoking  status  — 

3.  History  of  asthma  or  exercise-induced  bronchospasm 

4.  Spirometry  data 

5.  Spirometry  before  and  after  exercise 

6.  Army  Physical  Fitness  Test  scores 

7.  Army  Physical  Fitness  Test  scores  before  and  after  basic  training 

8.  Oxygen  saturation  with  increasing  altitude 

9.  Height  and  weight 

Key  to  Genotypes 

ACE:  Angiotensin  Converting  Enzyme  Insertion/Deletion  Polymorphism,  Intron  16 
NOS1:  Neuronal  Nitric  Oxide  Synthase  CA  repeat  Polymorphism,  Exon  29 
EOT:  Eotaxin  23ALA  ->  23  THR  Polymorphism 

MCP-4:  Monocyte  Chemo  attractant  Protein  4,  Chromosome  17q1 1.2  promoter 
mutation 

Gal-3:  Galectin  3,  Chromosome  14,  Exons  3  and  6 

Cats:  Cathepsin  S,  Chromosome  1 

NOS3:  Nitric  Oxide  Synthase  3,  Chromosome  7 

IL-18:  Interleukin  18,  Promoter  Region,  Chromosome  2q12 

C3a:  Complement  Cascade  Protein _ _ _ _ 


Table  2.  EMGB  Sample  Availability 

Year 

^00 

2001 

2002 

Terminal  Samples  Banked  to 
Date 

283 

283 

283 

Samples  Exhausted  to  Date 

0 

15 

16 

Immortalized  Samples 
Banked  to  Date 

0 

60 

60 

Total  Samples  Available 

283 

328 

327 
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Three  peer-reviewed  manuscripts  using  samples  from  the  EMGB  have  been 
published  over  the  past  three  years  (Table  3).  Angiotensin  Converting  Enzyme  (A0E), 
nitric  oxide  synthase  CA  (NOS1),  and  the  eotaxin  (EOT)  genotypes  have  been 
examined  in  population  studies  using  the  EMGB.  The  Angiotensin  Converting  Enzyme 
I/D  polymorphism  in  intron  16  has  been  implicated  by  some  as  a  marker  of  physical  ' 

performance  (5:7;9),  though  others  have  questioned  this  association  (1 1  ;12).  Wa. 
concluded  that  the  ACE  genotype  does  not  have  a  strong  effect  on  aerobic  power  or  f 
muscular  endurance  in  healthy,  young  American  adults  drawn  from  an  ethnically  and 
geographically  diverse  population  (14).  The  neuronal  NOS1  CA  repeat  polymorphism  in 
exon  29  has  been  found  to  be  associated  with  asthma,  in  a  study  to  which  the  EMGB 
contributed  samples  (6).  The  EMGB  was  also  used  to  determine  the  frequency  of  a 
novel  mutation  In  the  eotaxin  gene  that  limits  eotaxin  secretion  (10). 


Table  3.  Publications  to  date  that  have  used  the  EMGB 

1.  Dikeidek,  M.,  A.  Bastille,  LA.  Sonna,  and  C.M.  Lilly.  Environmental  Medicine 
Genome  Bank  (EMGB):  Hardy-Weinberg  equilibrium  at  an  eotaxin  locus  on 
chromosome  17.  Technical  Report  T-02/14.  USARIEM,  Natick,  MA,  2002. 

2.  Nakamura,  H.,  A.  Luster,  T.  Nakamura,  K.  Ho,  L.  Sonna,  A.  Deykin,  E.  Israel,  J. 
Drazen,  and  C.  Lilly.  Variant  eotaxin:  Its  effects  on  the  asthma  phenotype. 
J.AIIergyClin.lmmunol.  108(6):  946-53,  2001 

3.  Sonna,  L.  A.,  M.  A.  Sharp,  J.^J.  Knapik,  M.  Cullivan,  K.  C.  Angel,  J.  F.  Patton, 
and  C.  M.  Lilly.  Angiotensin-converting  enzyme  genotype  and  physical 
performance  during  US  Army  basic  training.  J.Appl.Physiol  91: 1355-1363,  2001; 

4.  Grasemann,  H.,  C.  Yandava,  K.  Storm  van's  Gravesande,  A.  Deykin,  A.  Pillari,  J. 
Ma,  L.  Sonna,  C.  Lilly,  M.  Stamp%^E.  Israel,  E.  Silverman,  and  J.  Drazen.  A 
neuronal  NO  synthase  (NOS1)  gene  polymorphism  is  associated  with  asthma. 
Biochem.Biophys.Res.Commun.  272:  391-394,  2000. 
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DISCUSSION 


The  EMGB  consists  of  DNA  samples  obtained  from  an  ethnically  diverse  and 
geographically  dispersed  population  of  subjects.  This  diversity  makes  the  EMGB  a 
valuable  resource  for  several  types  of  genetic  studies.  At  present,  we  envision  three 
principle  uses  for  the  bank.  First,  given  a  gene  known  or  suspected  to  be  of  interest  to 
environmental  medicine,  the  EMGB  can  be  used  to  identify  new  polymorphisms  in  this 
gene  and  to  obtain  an  estimate  of  the  frequency  of  these  polymorphisms  in  young, 
healthy  U.S.  adults.  Because  the  information  collected  in  the  EMGB  includes  both 
ethnic  origin  and  gender,  it  is  also  possible  to  compare  allele  frequencies  across 
important  demographic  subgroups.  Second,  the  EMGB  is  a  source  of  control  material 
for  genetic  studies  of  human  diseases,  such  as  asthma.  Third,  some  of  the  donor 
phenotypes  in  the  EMGB  (particularly  those  from  study  #2)  are  characterized  well 
enough  to  allow  genetic  association  studies. 


Previously,  one  significant  limitation  of  the  EMGB  was  that  sarnples  were  not 
easily  renewed.  Now  that  techniques  have  been  adopted  to  immortalize  lymphocytes, 
anonymous  samples  from  individual  donors  can  provide  a  renewable  source  of  DNA. 


In  summary,  the  current  heterogeneity  of  the  EMGB  makes  it  a  valuable  resource 
of  anonymous  samples  for  genetic  research.  It  has  already  proven  to  be  of  value  in 
collaborative  studies  of  human  disease  (6;10)  and  used  to  examine  the  genetic  basis  of 
physical  performance  (14).  The  addition  of  technology  to  allow  source  DNA  to  be 
renewed  has  greatly  enhanced  the  longevity  and  potential  value  of  this  resource. 
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